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ABSTRACT 

Bacterial spores are a continuous problem for both 
food-based and health-related industries. Decades 
of scientific research dedicated towards under- 
standing molecular and gene regulatory aspects of 
sporulation, spore germination and spore properties 
have resulted in a wealth of data and information. 
To facilitate obtaining a complete overview as well 
as new insights concerning this complex and tightly 
regulated process, we have developed a database- 
driven knowledge platform called SporeWeb (http:// 
sporeweb.molgenrug.nl) that focuses on gene 
regulatory networks during sporulation in the 
Gram-positive bacterium Bacillus subtilis. Dynamic 
features allow the user to navigate through all 
stages of sporulation with review-like descriptions, 
schematic overviews on transcriptional regulation 
and detailed information on all regulators and the 
genes under their control. The Web site supports 
data acquisition on sporulation genes and their 
expression, regulon network interactions and 
direct links to other knowledge platforms or 
relevant literature. The information found on 
SporeWeb (including figures and tables) can and 
will be updated as new information becomes avail- 
able in the literature. In this way, SporeWeb offers 
a novel, convenient and timely reference, an infor- 
mation source and a data acquisition tool that will 
aid in the general understanding of the dynamics 
of the complete sporulation cycle. 

INTRODUCTION 

During adverse environmental conditions, bacterial cells 
adopt developmental strategies, such as endospore 



formation, to ensure their survival. Bacterial spores are 
a continuous problem in food- and health-related 
industries because of their persistence after treatments 
and their ability to revert to vegetative cells through the 
process of germination (1,2). For instance, the disease of 
anthrax can persevere through the ingestion of spores that 
are able to survive the gastrointestinal tract and germinate 
to vegetative ceUs that produce lethal toxins (3). On the 
other hand, use of bacterial spores in the form of 
bioinsecticides (4), antigen delivery systems and vaccines 
(5) or probiotics (6,7) are upcoming fields that offer at- 
tractive applications. Therefore, a better understanding of 
the sporulation and germination processes, the level of 
heterogeneity therein, aU genes and proteins involved, as 
well as influential effects of environmental factors have 
formed important fields of study for the past decades (8) 
and have provided a wealth of knowledge (9-12). 

Most of the work on sporulation has been performed 
using the Gram-positive non-pathogenic organism 
Bacillus subtilis. The obtained data are extremely 
valuable and are often used as a reference model in sporu- 
lation research concerning other (pathogenic) bacteria, 
including Bacillus anthracis (13,14), Bacillus cereus 
(15,16) and various Clostridium species and strains that 
are of both medical and industrial importance (including 
Clostridium difficile, Clostridium perfringens and 
Clostridium botulinum) (17-22). This results in even more 
data and information, with various theories and specula- 
tions on molecular mechanisms, conservation of 'core' 
sporulation genes and emergence of evolutionary founda- 
tions (23,24). Sporulation of B. subtilis is an extremely 
complex cellular developmental process (11,25,26). 
New technological advances such as RNA sequencing, 
identification of small non-coding RNAs and increased 
understanding of processes through mathematical 
modelling aUow us to answer questions beyond previous 
expectations (27-31), but simultaneously add to the 
complexity. Newly sequenced bacterial genomes of other 
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spore-formers are increasingly available due to faster and 
cheaper methodologies and demand efficient analyses and 
readily available databases for comparison purposes (32). 
Therefore, a general overview on how spore formation is 
established (including which genes and regulatory 
pathways are involved) is very valuable to the field, but 
due to the complexity and dynamics increasingly difficult 
to obtain. 

In this work, we describe a novel knowledge and data 
acquisition platform called SporeWeb (http://sporeweb. 
molgenrug.nl), which focuses on all developmental stages 
of sporulation of B. subtilis from a gene regulatory point 
of view. Through an interactive web interface querying the 
SporeWeb database (details available in Supplementary 
Material), it offers both a textual description and a graph- 
ical representation of the sequence of events throughout 
spore development (Figure 1). Importantly, it easily hnks 
to more catalogued information present on other know- 
ledge Web sites, such as SubtiWiki (http://subtiwiki.um- 
goettingen.de/) (33). This allows the reader to grasp what 
happens inside the cell on the regulatory level, with add- 
itional detailed information on key regulatory proteins 
involved. The database-driven SporeWeb Web site is 
dynamic and will be updated and extended when novel 
scientific data become available in the future. We believe 
that SporeWeb will be a continuous valuable asset to the 
research field of bacterial sporulation and will aid in our 



overall understanding of this complex developmental 
process. 

An interactive journey through all stages of the bacterial 
sporulation process 

Commitment to sporulation is characterized by asymmet- 
ric cell division and expression of dedicated gene sets (34). 
This expression is tightly regulated in various sequential 
developmental stages and governed by complex biochem- 
ical communication between the two compartments of the 
cell (11). SporeWeb offers an interactive review of this 
complete process, which is the result of an extensive lit- 
erature study. The homepage serves as a starting point for 
any sporulation stage of interest, which can be accessed by 
clicking on the homepage image or by selecting the 'State' 
in the menu bar. Subsequent pages offer both detailed 
descriptions and schematic representations of develop- 
ment. The figures are interactive and dynamic: they 
contain clickable items of interest and ensure updated in- 
formation as genes are added to or edited in the database. 
Legends to the figures are described in the vertical grey bar 
on the right of the web page, whereas a review-hke 
description of the sporulation state is shown below the 
figure, with direct hnks to relevant hterature references. 

Sporulation-specific regulators and their regulons are 
described on individual pages, which can be accessed by 




Figure 1. Graphical representation of the different levels in the SporeWeb Web site. 
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SpoIIID during commitment to sporulation 

The gene encoding SpoIIID is part of the regulon and encodes a DNA binding protein that acts 
both positively and negatively on a subset of o^-regulated genes. It plays an important role In 
the progression from stage III into stage IV of sporulation by enhancing the expression of genes 
required for the synthesis of pro-a"^ ^^"^ . Nevertheless, the majority of the a^-dependent genes 
co-controlled by SpoIIID are repressed by this secondary regulator thereby fine-tuning timing 
of protein synthesis during sporulation stages II and III in the mother cell. Through the action of 
SpoIIID, genes that are involved in the process of engulfment, spore cortex synthesis and the 
appearance of and are expressed in pulses that can time-wise be correlated and clustered 
together according to functionality For instance, short expression pulses of genes encoding 
some spore outer membrane proteins early in sporulation are necessary to ensure their proper 
localization before the engulfment process is complete. Also the production of some coat 
proteins not needed until the final stages of spore coat assembly is delayed by the action of 
SpoIIID 1". 



Figure 2. The 'SpoIIID during commitment' page on SporeWeb. Detailed information on tlie role of a sporulation-specific regulator during any 
stage in sporulation can be found on such pages. The schematic representation depicts its own regulation as well as the genes under its control; 
positively regulated in a blue box, negatively regulated in a red box. A detailed list containing all genes within the regulon can be accessed using the 
SubtiWiki link, or downloaded as an Excel file by clicking on the coloured boxes or the XLXS icon. To see what happens to this regulator during 
other stages in sporulation, simply click on the link provided in the grey bar on the right. 



clicking on the item in the schematic figures or via the 
menu bar. An example of such a page is shown in 
Figure 2. Genes under transcriptional control of a 
specific regulator have been indicated in blue or red 
boxes for transcriptional activation or repression, respect- 
ively. Additionally, there is a direct link to the SubtiWiki 
list of regulon members. Lists and descriptions of all genes 
within the regulon can be downloaded in the form of up- 
datable Excel sheets by clicking on the coloured boxes or 
via the Excel icon in the top right corner (Figure 2). 

Graphic representations of regulon interactions define 
subgroups of co-regulated genes 

As sporulation progresses, sporulation-specific sigma 
factors are expressed and activated in a spatial and 
temporal manner (11,26). Together with secondary regu- 
lator proteins, they control the timing, sequence and level 
of gene expression that are necessary for formation, 
maturation and release of the endospore. Various 
transcriptomic studies in B. subtilis have led to the identi- 
fication of genes controlled by these sigma factors and 



other regulators to map the sporulation gene regulatory 
networks (35^2). In SporeWeb, we have visualized 
these networks using Cytoscape-generated layouts for 
every sporulation-specific stage (details available in 
Supplementary Material) (43). These layouts can be 
accessed via the 'Cytoscape' option in the menu bar, or 
the Cytoscape icon on the top of every State page. 

An example of such a layout is shown in Figure 3A. 
This representation immediately shows which genes are 
under single, dual or even triple or quadruple control 
and which genes are co-regulated during a specific sporu- 
lation stage. There is a zoom-in function that allows the 
user to identify genes or regulators of interest. The name 
and direct hnks to the gene SubtiWiki and MicroScope 
MaGe pages (http://www.genoscope.cns.fr/agc/micro 
scope/home/) are provided when the node is clicked. 
Furthermore, recently published spore-specific gene clas- 
sification of the minimal sporulation gene set by Galperin 
et al. (24) has been integrated to immediately appreciate 
similarities and differences in regulation of functional 
classes of genes. Additionally, a visualization tool called 
'User Subset' has been implemented in the Cytoscape 
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Figure 3. Cytoscape-generated layouts on gene regulatory networks. (A) Active regulators during completion of engulfment are indicated by green 
squares (proteins) or blue hexagons (sigma factors). Positive or negative effects on the transcription of genes (coloured circles) are indicated by 
connecting blue and red lines, respectively. Important sporulation genes indicated as being part of the minimal sporulation gene set have been colour- 
coded according to their functional category as determined by Galperin et al. (24). Genes unassigned to these functional categories are indicated as 
yellow circles. These layouts are available on SporeWeb for five different stages during spore formation. (B) A personal Cytoscape layout on specific 
genes of interest can be generated using the 'User Subset' option in the 'Cytoscape' menu (indicated by a black arrow). Genes (separated by a comma 
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Figure 4. Heatmap representation of gene expression values during the engulfment state of sporulation. Tables like these are available on SporeWeb 
for five different stages in spore formation. Expression value data were derived from Nicolas et a!. (45). Activation of gene expression is indicated as 
positive values in blue boxes, whereas downregulation is indicated as negative values in red boxes. Genes are categorized in classes listed A-L 
according to their documented regulation. Expression values are shown throughout the complete sporulation process (t = 0-t = 8) and for three time 
points taken during spore germination (ger). 



Figure 3. Continued 

only) should be typed in the white bar and will be organized in a graphical network via the 'Add Cytoscape" option. Coloured nodes (shapes) and 
edges (lines) represent genes and connections as described for Figure 3A. Three-letter abbreviations at the edges indicate during which stage in 
sporulation the particular regulation is relevant. For SpoOA regulation, thicker edges represent high-threshold genes, whereas thin edges represent 
low-threshold genes (44). 
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menu bar that will allow the users to generate their own 
Cytoscape interaction figure of specific sporulation genes 
of interest (Figure 3B). 

Gene expression values during sporulation are visualized 
in informative heatmaps 

A large group of important sporulation genes has already 
been identified and characterized, although many remain 
whose function and/or regulation is still unknown. To 
further visualize timing and co-regulation of gene expres- 
sion during sporulation, we have displayed a recent sporu- 
lation-specific transcriptional dataset from Nicolas et al. 
(45) in colour-coded heatmaps. These heatmaps can be 
accessed via the 'Heatmap' option on the menu bar, or 
via the heatmap icon at the top of every State page. Genes 
are categorized in classes based on their previously docu- 
mented regulation. Their expression values during the 
complete course of sporulation as well as during three 
time points of germination are shown in colour-coded 
boxes (Figure 4). In this way, differences and similarities 
in expression between co-regulated genes are visible and 
can provide clues about possible function and/or regula- 
tion of uncharacterized genes. The heatmaps can be down- 
loaded via the Excel icon on the top right of the page. 

Concluding remarks and perspectives 

Knowledge on bacterial sporulation is rapidly growing, 
partly due to novel technological developments. This 
progress also reveals additional levels of complexity and 
makes it increasingly difficult to obtain a general under- 
standing, especially for non-specialists in the field. 
Furthermore, rapid advances in DNA and RNA 
sequencing technologies have enabled faster and cheaper 
access to genomic- and transcriptional data of a large 
number of bacterial species. This leads to an expansion 
of our knowledge from laboratory-adapted model 
bacteria, such as B. subtilis, to more industrially or med- 
ically relevant species and strains. The data reveal high 
levels of conservation of certain genes or regulatory 
modules on the one hand and highlight important differ- 
ences in gene presence/absence and regulatory events on 
the other, which have significant implications for the 
overall process of spore formation in specific groups of 
bacteria (23,24). The wealth of information that has 
been generated over decades by research on bacterial 
model organisms is extremely useful and usable as refer- 
ence material for those bacterial species for which genetic 
manipulation and in vivo validation is less straightforward. 

To provide researchers from all disciphnes and expertise 
levels an accessible overview of the current state of know- 
ledge on B. subtilis sporulation, we have constructed 
an interactive and graphical knowledge platform called 
SporeWeb. We believe that the potential of SporeWeb 
lies in the combination of a source of information on bac- 
terial sporulation, an accessible starting point for further 
detailed investigation and a dynamic platform that can be 
adjusted and supplemented as new relevant data become 
available in the future. Moreover, the user-friendly inter- 
face and intuitive organization provide comprehensible 
data acquisition for both specialists and non-specialists 



in the field. Future perspectives for SporeWeb can 
include expansion to 'sister' Web sites. These can 
contain similar content on sporulation from a proteomics 
point of view or of (for instance) members of the B. cereus 
group (including B. anthracis and Bacillus thuringiensis) 
and/or Clostridium species, which would be a constructive 
addition to the applications of SporeWeb and a valuable 
contribution of knowledge to the entire field of sporula- 
tion research. 

SUPPLEMENTARY DATA 

Supplementary Data are available at NAR Online. 
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